An Algorithm to Reduce the Communication Traffic for Multi-Word Searches in a Distributed Hash Table

نویسندگان

  • Yuichi Sei
  • Kazutaka Matsuzaki
  • Shinichi Honiden
چکیده

In distributed hash tables, much communication traffic comes from multi-word searches. The aim of this work is to reduce the amount of traffic by using a bloom filter, which is a space-efficient probabilistic data structure used to test whether or not an element is a member of a set. However, bloom filters have a limited role if several sets have different numbers of elements. In the proposed method, extra data storage is generated when contents’ keys are registered in a distributed hash table system. Accordingly, we propose a “divided bloom filter” to solve the problem of a normal bloom filter. Using the divided bloom filter, we aim to reduce both the amount of communication traffic and the amount of data storage.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Distributed Memory Algorithm for Lexicon Building

A parallel algorithm for preparing word frequency concordances over two speci ed sets of documents from a collection is presented. Good parallel e ciency is demonstrated on a 128-node distributed memory machine using sets whose combined size exceeds one gigabyte. It is demonstrated that e ciency is heavily in uenced by hashing and communication strategies. A two-stage hashing algorithm is propo...

متن کامل

P2P Traffic Optimization based on Congestion Distance and DHT

P2P traffic has brought serious impact on the bearer network, and how to utilize network bandwidth has become a common concern problem for the Internet Service Provider (ISP) and the P2P user. A congestion distance and DHT based P2P traffic optimization method is proposed in this paper. The P2P traffic optimization model imports both aims of the ISP and the P2P user, and the cost of congestion ...

متن کامل

Self-Correcting Broadcast in Distributed Hash Tables

We present two broadcast algorithms that can be used on top of distributed hash tables (DHTs) to perform group communication and arbitrary queries. Unlike other P2P group communication mechanisms, which either embed extra information in the DHTs or use random overlay networks, our algorithms take advantage of the structured DHT overlay networks without maintaining additional information. The pr...

متن کامل

بهبود به‌روزرسانی پایگاه داده تحلیلی نیمه‌آنی

Near-real time data warehouse gives the end users the essential information to achieve appropriate decisions. Whatever the data are fresher in it, the decision would have a better result either. To achieve a fresh and up-to-date data, the changes happened in the side of source must be added to the data warehouse with little delay. For this reason, they should be transformed in to the data wareh...

متن کامل

Decentralization of a Multi Data Source Distributed Processing System Using a Distributed Hash Table

A distributed processing system (DPS) contains many autonomous nodes, which contribute their own computing power. DPS is considered a unified logical structure, operating in a distributed manner; the processing tasks are divided into fragments and assigned to various nodes for processing. That type of operation requires and involves a great deal of communication. We propose to use the decentral...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006